Picture for Guorong Li

Guorong Li

Exploring the Temporal Consistency for Point-Level Weakly-Supervised Temporal Action Localization

Add code
Feb 05, 2026
Viaarxiv icon

Boosting Point-supervised Temporal Action Localization via Text Refinement and Alignment

Add code
Feb 01, 2026
Viaarxiv icon

Towards Universal Modal Tracking with Online Dense Temporal Token Learning

Add code
Jul 27, 2025
Figure 1 for Towards Universal Modal Tracking with Online Dense Temporal Token Learning
Figure 2 for Towards Universal Modal Tracking with Online Dense Temporal Token Learning
Figure 3 for Towards Universal Modal Tracking with Online Dense Temporal Token Learning
Figure 4 for Towards Universal Modal Tracking with Online Dense Temporal Token Learning
Viaarxiv icon

SDVPT: Semantic-Driven Visual Prompt Tuning for Open-World Object Counting

Add code
Apr 24, 2025
Figure 1 for SDVPT: Semantic-Driven Visual Prompt Tuning for Open-World Object Counting
Figure 2 for SDVPT: Semantic-Driven Visual Prompt Tuning for Open-World Object Counting
Figure 3 for SDVPT: Semantic-Driven Visual Prompt Tuning for Open-World Object Counting
Figure 4 for SDVPT: Semantic-Driven Visual Prompt Tuning for Open-World Object Counting
Viaarxiv icon

P2Object: Single Point Supervised Object Detection and Instance Segmentation

Add code
Apr 10, 2025
Viaarxiv icon

The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning

Add code
Mar 31, 2025
Figure 1 for The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning
Figure 2 for The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning
Figure 3 for The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning
Figure 4 for The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning
Viaarxiv icon

Less is More: Token Context-aware Learning for Object Tracking

Add code
Jan 01, 2025
Viaarxiv icon

MambaLCT: Boosting Tracking via Long-term Context State Space Model

Add code
Dec 18, 2024
Figure 1 for MambaLCT: Boosting Tracking via Long-term Context State Space Model
Figure 2 for MambaLCT: Boosting Tracking via Long-term Context State Space Model
Figure 3 for MambaLCT: Boosting Tracking via Long-term Context State Space Model
Figure 4 for MambaLCT: Boosting Tracking via Long-term Context State Space Model
Viaarxiv icon

GaGA: Towards Interactive Global Geolocation Assistant

Add code
Dec 12, 2024
Viaarxiv icon

ClickTrack: Towards Real-time Interactive Single Object Tracking

Add code
Nov 24, 2024
Figure 1 for ClickTrack: Towards Real-time Interactive Single Object Tracking
Figure 2 for ClickTrack: Towards Real-time Interactive Single Object Tracking
Figure 3 for ClickTrack: Towards Real-time Interactive Single Object Tracking
Figure 4 for ClickTrack: Towards Real-time Interactive Single Object Tracking
Viaarxiv icon